CDS

Accession Number TCMCG017C15465
gbkey CDS
Protein Id OMO92623.1
Location join(106880..107040,107642..107765,107873..107927,108348..108461,108560..108656,108837..108903,109052..109111,109210..109819,110187..110596,110895..111115,111598..111691)
GeneID InterPro:IPR005011
Organism Corchorus olitorius
locus_tag COLO4_17461

Protein

Length 670aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01016343.1
Definition SART-1 protein [Corchorus olitorius]
Locus_tag COLO4_17461

EGGNOG-MAPPER Annotation

COG_category A
Description U4 U6.U5 tri-snRNP-associated protein
KEGG_TC -
KEGG_Module M00354        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11984        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03040        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAAGAAAAACCATGAGGATGATTATGAAGGAAGCAGAGATGGAGAGCTTCCTGCAGACTATGAACAGAGTAGAGATAAGGATGAACCTGTGTTAAATGTGGGCAGCAAGGCTGGTGCAGTGGAGGCATCCTCATTAGCACTTGAGCAGCGCATTTTAAGAATGAAAGAAGAGAGATTGAAGAAGAAATCTGATGGTGCTTCAGACATTTTAGCATGGGTTAATAAAAGTCGTAAGCTTGAGAAGGAGAAAGCATTGCAGCTTTCAAAAATTTTTGAGGAGCAGGATAATCTTATTCAAGAGGAAAACGAAGATGAGGATGCTGGTGATCGTGCTACTCATGATCTGGCCGGAGTTAAAGTTCTTCATGGCCTTGACAAAGTGATGGATGGTGGAGCTGTTGTTTTGACACTAAAAGATCAGAGCATACTTGCTAATGGTGACATTAATGAAGATGTTGATATGCTTGAAAATGTTGAAATTGGAGAGCAGAAGCAGCGGGATGACGCTTACAAGGCTGCAAAGAAAAAAACAGGGCTTTATGATGACAAGTTCAATGATGAGCCGGGTTCACAGAAAAAAGTACTGCCACAATATGATGATCCAGTTGCAGATGAGGGGATAACTCTGGATGAAAGAGGGCGCTTTTCTGGTGAAGCGGAAAAGAAATTGGAGGAGCTCCGTAAAAGGTTACAAGCTGCTCCCACGAATAACCGTGTTGAAGATCTTGCTAGTGCTGTGAAGATTTCATCAGATTATTATACCCAAGAGGAAATGGTTAAGTTTAAAAAGCCCAAGAAAAAGAAAGCTTTGCGGAAGAAAGACAAGTTGGATATAGATGCCCTTGAAGCGGAAGCTATCTCTTCCGGGCTAGGTGCTGGAGATCTTGGTTCAAGAAATGATGCTAGAAGACAGGCAACTAAAGAGGAGGAGGCCAAATCTGAGGCTGAAAAGAGAAACAGTGCATACCAGTCAGCATATGCCAAGGCAGATGAGGCATCTAAATCACTGCGTACTGAACAAACTCTTACGGTTAAATCCGAGGAAGATGAGAACCAAGTCTTTGCTGATGATGAGGAGGATCTTTATAAATCCCTTGAGAGAGCAAGGAAATTAGCTCTTAAAAAGCAAGAAGAAAAATCAGGTCCCCAAGCTATTGCGCTCCTTGCTACTACAGCTGTTACCACTCAAACTGCAGAGGATCAAAGTAACACAACTGGAGAGGCACAAGAAAGACTTGTAATCTCAGAGATGGAAGAGTTTGTAATGGGCATTCAGCTTGATGAAGAAGCTCATAAGCCGAGCAGCGAAGATGTTTTCATGGATGAGGATGAAGTGCCCGGAGCTCCTGAACATGATGGGGAAAATGGAGAAAATGAGGCCGGTGGATGGAAAGAAGTAGTTGATGCTAGTCCTGATGAAAAACCTGCTAACGAGGACAAGGATGAAATTGTTCCTGATGAAACAATCCACGAAGTTGCAGTGGGTAAAGGACTAGCAGGTGCACTGAAGCTGCTTAAAGATCGAGGAACACTTAAAGAAACTATTGAATGGGGTGGCAGGAACATGGACAAGAAAAAGAGCAAACTTGTTGGTATTGTAGATGATAATCGTGAAAATGATAGATTTAAAGATAATCGCGAAAATGATAGATTTAAAGATATTCGCATTGAGAGGACAGATGAATTTGGTCGAATTATGACACCCAAGGAAGCCTTTCGGAATCTTTCTCATAAATTCCATGGTAAGGGTCCAGGCAAAATGAAGCAAGAGAAACGGATGAAGCAATATCAGGAAGAATTGAAGCTGAAGCAAATGAAAAATTCAGATACACCTTCACTGTCGGTGGAGAGGATGAGAGAAGCTCAAAGTCAGCTGAAAACGCCCTACCTTGTCCTTAGTGGCCATGTCAAACCAGGGCAAACAAGTGATCCTAGAAGTGGCTTTGCTACTGTTGAGAAGGATCTTCCAGGAGGCTTGACACCCATGCTTGGTGATAGAAAGGCAAGTTAA
Protein:  
MKKNHEDDYEGSRDGELPADYEQSRDKDEPVLNVGSKAGAVEASSLALEQRILRMKEERLKKKSDGASDILAWVNKSRKLEKEKALQLSKIFEEQDNLIQEENEDEDAGDRATHDLAGVKVLHGLDKVMDGGAVVLTLKDQSILANGDINEDVDMLENVEIGEQKQRDDAYKAAKKKTGLYDDKFNDEPGSQKKVLPQYDDPVADEGITLDERGRFSGEAEKKLEELRKRLQAAPTNNRVEDLASAVKISSDYYTQEEMVKFKKPKKKKALRKKDKLDIDALEAEAISSGLGAGDLGSRNDARRQATKEEEAKSEAEKRNSAYQSAYAKADEASKSLRTEQTLTVKSEEDENQVFADDEEDLYKSLERARKLALKKQEEKSGPQAIALLATTAVTTQTAEDQSNTTGEAQERLVISEMEEFVMGIQLDEEAHKPSSEDVFMDEDEVPGAPEHDGENGENEAGGWKEVVDASPDEKPANEDKDEIVPDETIHEVAVGKGLAGALKLLKDRGTLKETIEWGGRNMDKKKSKLVGIVDDNRENDRFKDNRENDRFKDIRIERTDEFGRIMTPKEAFRNLSHKFHGKGPGKMKQEKRMKQYQEELKLKQMKNSDTPSLSVERMREAQSQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDLPGGLTPMLGDRKAS